International Journal of Scientific and Technical Research in Engineering (IJSTRE) 
www.ijstre.com Volume 3 Issue 2 II March 2018. 


Using Partitioned Design Matrices in Analyzing Nested-Factorial 

Experiments 


Sigit Nugroho 


Department of Statistics, University of Bengkulu. 


/j- ■ LW 

Abstract. Using QR Decomposition to calculate the sum of squares of a model has a limitation that the number of rows, 
which is also the number of observations or responses, has to be greater than the total number of parameters used in the 
model. The main goal in the experimental design model, as a part of the Linear Model, is to analyze the estimable function 
of the parameters used in the model. In order not to deal with generalized invers, partitioned design matrix may be used 
instead. This partitioned design matrix method may be used to calculate the sum of squares of the models whenever the total 
number of parameters is greater than the number of observations. It can also be used to find the degrees of freedom of each 
source of variation components. This method is discussed in a Balanced Nested-Factorial Experimental Design. 
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I. INTRODUCTION 

m-, ■ ‘ n I 

A factorial experiment is an experiment in which all levels of a given factor are combined with all levels of 
every other factor in the experiment. There will be an interaction between the two factors when a change in one 
factor produces a different change in the response variabel at one level of another factor than at other levels of 
this factor. Some of the advantages of a factorial experiment can be seen : (1) more efficiency is possible than 
with one-factor-at-a-time experiments, (2) all data are used in computing both effects, (3) some information is 
gleaned on possible interaction between the two factors (Hicks, 1982) 

In a factorial experiment the effects of a number of different factors are investigated simultaneously. The 
treatments consist of all combinations that can be formed from different factors. Some instances where factorial 
experimentation may be suitable : (1) in exploratory work where the object is to determine quickly the effects of 
each of a number of factors over a specified range, (2) in investigations of the interactions among the effects of 
several factors, since information is best obtained by testing all combinations, (3) in experiments designed to 
lead to recommendations that must apply over a wide range of conditions (Cochran & Cox, 1957). 

Factorial treatment structure exists when the g treatments are the combinations of the levels of two or more 
factors. We call these combination treatments factor-level combinations or factorial combinations to emphasize 
that each treatment is a combination of one level of each of the factors. (Oehlert, 2010). 

A factor is said to be nested within a second factor if each of its levels is observed in conjunction with just 
one level of the second factor. Nested factors are usually, but not always, random effects, and they are usually, 
but not always, blocking factors (Dean & Voss, 1999). If each level of factor A contains different levels of 
factor B, we say that factor B is nested within factor A. Experiments with two or more factors satisfying this 
definition are called nested factor experiments or nested design factor (Lentner & Bishop, 1986). 

The factorial and nested designs are mostly used in manufacturing, processing, and the improvement of 
products (Longford, 1987; Rundan and Searle, 1991; Townsend and Searle, 1971) 

QR Decomposition could be used to find the sum of squares of its source of variation’s components as long 
as the number of rows of the matrix is at least equal to the number of columns. Thus, in the balanced nested- 
factorial design QR, decomposition cannot be used whenever rabc < 1+a+b+ab+bc+abc. Partitioning the 
design matrix with respect to source of variation’s component could help calculating their sum of squares and 
determining the rank of its partitioned matrices which are also their degrees of freedom. Using classical-sigma 
notation to calculate sum of squares of each source of variation is not as easy as using the partitioned matrix 
method to do so. 
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II. 


NOTATION 


Consider the Balanced Nested-Factorial model as follows 

Y m =l l+a ' + Pj + r(P) kU ) +a r(/ J )m +E m) (1) 

where Y i]kl = the l-th observation having i-th level effect of factor A, j-lh level effect of facor B, and k-th level 
effect of factor C which is nested in j-th level of factor B; p is the overall mean; a, the effect of i-th level of 
factor A i=l,2,...,a; ftj the effect of /-th level of factor B j=l,2,...,b\ af}jj the effect of interaction i-th level of 
factor A and /-th level of factor B; y([j) k i n the effect of k- th level of factor C which is nested in the/-th level of 
factor B; k=l,2,...,c, uy(P)ik(j) the interaction component for i-th level of A and k-th level of C nested in /-th level 
of B; and the experimental error 1=1,2,...,r. It is assumed that Si^'s are i.i.d. N(0, <r 2 ), ;/(/i) /c( /) ',vare 

i.i.d. N (0,<r 2 ), and ay(P) ik(j) 's are i.i.d. N( 0,<r 2 r ). £ l<ijk) 's , y(P) kU) ’s and ay(/3) ik(j) 's are mutually 
independents. 

The sum of squares calculations for a Balanced Nested-Factorial are 


CF = 




Y ijkl 


rabc 
TotalSS = 


- = correction factor 


i,j,kJ 


Y 2 
Y ijkl 


CF 


[A] = — Yj} -CF 
rbc i 

SS[B] = — XT/ -CF 
rac j 

SS[AB] = — y K 2 -CF-SSforA-SSforB 
rc 

i-j 

SS[C(B)] = - l -YY 2 k -CF-SSforB 
rab 


( 2 ) 


sswc(fl)]=2 


r . , 

j V l ’ k 


rac 


ErrorSS = TotalSS - SS [A] - SS [5] -SS[AB] - SS[C(B)} - SS\ AC(B)\ 


III. LINEAR MODELS IN MATRIX NOTATION 

A typical linear model considered is 

Y=xp + s ( 3 ) 

where Y is an n X 1 vector of observations, A is an n X p matrix of known constants called the design matrix, /? 
is a p X 1 vector of unobservable parameters, and s is an n X 1 vector of unobservable random errors. It is 
assumed that E( s ) = 0 and Covf s) = cr 2 I. 

The design matrix X for the balanced nested-factorial experiments in (3) having size rabc X 
(1+a+b+ab+bc+abc), is partitioned according to its source of variation components : Constants, Main Effect of 
Factor A, Main Effect of Factor B, Interaction Effect of Factor A and B, Main Effect of Factor C which is 
nested in Factor B, and Interaction Effect of Factor A and C which is nested in Factor B. Let 
X m =l ax i®kxi (g) l C xi (8) lrxi be the constant design matrix, =I axa ®l foxl ®l cxl ®l,. xl the main effect of 
factor A design matrix, Xp = l axl ®! cxl ®! rxl the main effect of factor B design matrix, 

Xa/3 — I axa ® Ibxb ®lcxt®hxl the interaction effect between factor A and factor B design matrix, 
X Y = l axl ® I hxh ®I CXC ® I rx | the main effect of factor C which is nested in factor B design matrix, and 
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X ay = I axa ® I hxh ® I CYC <S>l ( . xl the interaction effect of factor A and factor C which is nested in factor B 
design matrix, then X = \x u \X a \X /3 \ X afi I X y I X ay \. 

Furthermore, the projection matrix has the form of M, = X,(X',X,) ! X,' . Therefore, for each partitioned 
design matrix, it is easily to verify that the projection matrices with respect to each of the source of variations 
are as follows: 

Jaxa ® hxb ® Jcxc ® J r*r > 

rabc 

M a = -J- I axa ® J bxb ® J cxc ® J rxr , 

rbc 

J axa ® I bxb ® J cxc ® / rxr , 

rac 

M a p = — / aXfl ® / fcxfc ® / cxc ® J rxr , 

rc 

A* r = — Jaxa ® ® Uxc ® Jrvr , and 

ra 

M ay = ®I bxb ®I cxc ®J rxr 

r 

Using the properties of Kronecker product, we can easily find simple form every combination of projection 
matrix multiplication. The results of these multiplications are summarized as in Table 1. 


IV. SUM OF SQUARES IN A MATRIX NOTATION 

In terms of matrix notation, the formulas for calculating sum of squares for balanced nested-factorial 
experiment as mentioned in (2), can be written as follows: 

SS[A]= Y'(M a ~M m )Y , 

SS[B]= Y'(Mp ~M m )Y_ , 

SS[AB] = T(M a p-M a -Mp+M M )Y, 

SS[C(B)]= T{M y -Mp)Y_ , 

SS[AC(B)] = Y'(M ay -M ap -M y +Mp)Y, 

Error SS = Y(I-M ay )Y and 
Total SS = Y' (I - M M )Y . 

Let the n X 1 random vector Y be distributed N(y : p, I). The random variable JJ = Y’AY is distributed as •/_’ 
(xv.K;l) with K degrees of freedom and noncentrality parameter 1, where ). = /r ’Ap/2, if and only if A is an 
idempotent matrix of rank K (Graybill, 2000). 

It can be easily verified, using the result presented in Table 1, that M p - M u , 

AT a j 3 — AT a — AT p -+- AT , M y —Mp, M ay —M a p—M y +M p and I—M ay are all idempotent 

matrices. In additions to those idempotent properties, they are also symmetric matrices. From the properties of 
symmetric and idempotent matrices, their ranks are just equal to their traces (Rencher and Schaalje, 2008). 
Thus, their ranks are 
tr(M a -M M )= a-1, 

tr(M p —M /u )= b-1, 

tr(M a p — M a —Mp + M M ) = ab-a-b+1 = (a-1 )(b-l), 
tr(M y -Mp) = bc-b = b(c-l), 

tr(M ay —M a p —M y + Mp) = abc-ab-bc+b = (a-l)b(c-l) and 
tr{l —M ay )= rabc-abc = (r-l)abc respectively. 

Using the above arguments, and without loss of generality that the random vector Y be distributed N(y : 0, 
I), therefore the distributions of sum of squares are as follows: 

SS[A] is distributed as Xa-i > 
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SS[B] is distributed as Xb-\ > 

SS[AB] is distributed as Xui-ixb-i) > 

SS[C(B)] is distributed as Xb(c-i) > 

SS[AC(B)] is distributed as xf a -\)b(c-\) > and 
Error 55 is distributed as ^ . 


V. HYPOTHESIS TESTING 

Mean of Square is defined as Sum of Square divided by its degrees of freedom. Therefore, we have the 
followings : 

MS [A ]=T (M a - M m )Y /(a - 1) , 

MSI BI = Y{M p -M M )YKb-\), 

MS[AB] = Y'(M a p-M a -M p + M M )YI({a-\)(b-X)) , 

MS[C(B)] = Y’(M r -M p )Y/(b(c- 1)) , 

MS[AC(B)J = Y\M ay -M ap -M y +M p )Y/((a-l)b(c-Vj) and 
MS [Error] =Y'(I - M ay )Y /((r -1 )abc ). 

We need to know first the expected mean squares (EMS) for each of the source of variation in the balanced 
nested-factorial design. 

EMS for Error = u 2 , 

EMS forAC(B) = cr 2 + r<j 2 y , 

EMS for C(B) = a 2 + raa 2 , 

EMS for AB = a 2 + ra 2 y + Y (M ap -M a -M p +M M )Y /((a - \)(b - 1)) , 

EMS for B =cr 2 + raa 2 + T (M p - M M )Y/(b - 1) dan 
EMS for A = cr 2 + rcr ay + Y (M a - M ^ )Y t(a - 1). 

We know that if W is distributed as chi-square with a degrees of freedom, Z is distributed as chi-square with 
b degrees of freedom, W and Z are independent to each other, then (W/a)/(Z/b) is distributed as F with a and b 
degrees of freedom. 

To check the independence of two matrices W and Z, we need to show that WZ = O. Using the information 
in Table 1, it is easy to verify that (M ay — M ap — M y +M p ){I — M ay ) = 0, therefore 

(M ay — M ap — M y +M p ) and ( I-M ay ) are independent; (M y -M p )(I-M ay ) =0, therefore ( M y -M p ) 
and (I-M ay ) are independent; {M ap —M a —M p + M M )(M ay — M ap —M y + M p ) = 0, therefore 
(M ap — M a ~M p +M^) and ( M ay -M ap -M y +M p ) are independent; {M p -M )A ){M y -M p ) = 0, 
therefore (M p - M u ) and (M y - M p ) are independent; and (M r/ - M fl )(M ay - M ap - M y + M p ) = 0, 
therefore (M a —M^) and (M ay —M ap —M y +M p ) are independent also. 

Therefore, we have the following results : 

• to test if there is a significant main effect of factor A is to reject the null hypothesis whenever 
MS[A]/MS[AC(B)] is large enough. MS[A]/MS[AC(B)] is distributed as F with a -1 and (a-l)b(c-l) 
degrees of freedom. 

• to test if there is a significant main effect of factor B is to reject the null hypothesis whenever 
MS[B]/MS[C(B)] is large enough. MS[B]/MS[C(B)] is distributed as F with b -1 and b(c-l) degrees of 
freedom. 

• to test if there is a significant AB interaction effect is to reject the null hypothesis whenever 
MS[AB]/MS[AC(B)] is large enough. MS[AB]/MS[AC(B)] is distributed as F with (a-l)(b-l) and (a- 
1 )b(c-l) degrees of freedom. 
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• to test if there is a significant variance of C which is nested in B is to reject the null hypothesis 
whenever MS[C(B)]/MS[Error] is large enough. MS[C(B)]/MS[Error] is distributed as F with b(c-l) 
and <r-1 )abc degrees of freedom. 

• to test if there is a significant variance of AC interaction where C is nested in B is to reject the null 
hypothesis whenever MS[AC(B)]/MS[Error] is large enough. MS[AC(B)]/MS[Error] is distributed as F 
with (a-l)b(c-l) and (r-1 )abc degrees of freedom. 


VI. EXAMPLE 


R routine which can be used to analyze balanced nested-factorial experimental data is provided in the 
Appendix. The following example is taken from Flicks (1982) : An investigator wished to improve the number 
of rounds per minute that could be fired from a Navy gun. Fie devised a new loading method, which he hoped 
would increase the number of rounds per minute when compared with the existing method of loading. To test 
this hypothesis he needed teams of men to operate the equipment. As the general physique of a man might 
affect the speed with which he could handle the loading of the gun, he chose teams of men in three general 
groupings- slight, average, and heavy or rugged men. The classification of such men was on the basis of an 
Armed Services classification table. He chose three teams at random to represent each of three physique 
groupings. Each team was presented with the two methods of gun loading in a random order and each team used 
each method twice. 


Group 


I 



II 



III 


Team 

1 

2 

3 

4 

5 

6 

7 

8 

9 

Method 1 

20.2 

26.2 

23.8 

22.0 

22.6 

22.9 

23.1 

22.9 

21.8 


24.1 

26.9 

24.9 

23.5 

24.6 

25.0 

22.9 

23.7 

23.5 

Method 2 

14.2 

18.0 

12.5 

14.1 

14.0 

13.7 

14.1 

12.2 

12.7 


16.2 

19.1 

15.4 

16.1 

18.1 

16.0 

16.1 

13.8 

15.1 


The R routine results the following output. 


[1,] 

[,U [,2] 

"Source" "Deg Frdm" 

[,3] 

SS" 

[ ,4] 
"MS" 

[,5] 

"P" 

[2,] 

"Method" 

M J M 

651.951" 

"651.951" 

"364.83" 

[3,] 

"Group" 

"2" 

"16.052" 

"8.026" 

"1.227' 

[4,1 

"Method x Group" 

"2" 

"1.187" 

"0.594" 

"0.332" 

[5,1 

"Team(Group)" 

"6" 

"39.258" 

"6.543" 

"2.831" 

[6,] 

"Method x Team(Group)" 

"6" 

"10.722" 

"1.787" 

"0.773" 

[7,1 

"Error" 

T8" 

"41.59" 

"2.311" 

M M 

[8,1 

"Total" 

"35" 

"760.76" 

M M 

M I 

Note that 

Fo.05;1;6 = 5.98, F 0 .05 ; 2;6 = 5.14, 
Fq.01;1;6 = 13.74, Fq.01;2;6 = 10.92, 

Fo.05;6;18 

Fq.01;6;18 

= 2.66 
= 4.01 




There are significant differences in Method effects at 1% level, and significant variance of Team effects 
which is nested in Group at 5% level, while the other source of variations are not significantly different. 
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APPENDIX 


: R routine for Balanced Nested-Factorial Experiments i 


##### Given the Data ##### 

#### a is the level size of factor A (Methods) 

#### b is the level size of factor B (Groups) 

#### c is the level size of factor C (Teams) which is nested in factor B (Groups) 
#### r is the number of replications 
a <-2 
b <-3 
c <-3 
r <-2 


##### Observed Data ##### 

y <-rbind(20.2, 24.1, 26.2, 26.9, 23.8, 24.9, 22.0, 23.5, 22.6, 
24.6, 22.9, 25.0, 23.1, 22.9, 22.9, 23.7, 21.8, 23.5, 
14.2, 16.2, 18.0, 19.1, 12.5, 15.4, 14.1, 16.1, 14.0, 
18.1, 13.7, 16.0, 14.1, 16.1, 12.2, 13.8, 12.7, 15.1) 

##### Basic Vectors and Matrices ##### 

va <- matrix! 1,a, 1) #vektor la 

vb <- matrix! 1 ,b, 1) #vektor lb 

vc <- matrix! l,c,l) #vektor lc 

vr <- matrix( 1 ,r, 1) #vektor lr 

la <- diag(l,a,a) #identitas a 

lb <- diag(l,b,b) #identitas b 

Ic <- diag(l,c,c) #identitas c 


##### Partitioned Design Matrices ##### 

Xmu <- kronecker(va,kronecker(vb,kronecker(vc,vr))) 
Xm <- kronecker(Ia,kronecker(vb,kronecker(vc,vr))) 
Xg <- kronecker(va,kronecker(Ib,kronecker(vc,vr))) 
Xmg <- kronecker(Ia, kronecker(Ib,kronecker(vc,vr))) 
Xt <- kronecker(va,kronecker(Ib,kronecker(Ic,vr))) 
Xmt <- kronecker(Ia,kronecker(Ib,kronecker(Ic,vr))) 


##### Projection Matrices ##### 

Mmu <-(Xmu %*%(solve(t(Xmu)%*%Xmu)))%*% t(Xmu) 
Mm <-(Xm %*%(solve(t(Xm)%*%Xm)))%*% t(Xm) 

Mg <-(Xg %*%(solve(t(Xg)%*%Xg)))%*% t(Xg) 

Mmg <-(Xmg %*%(solve(t(Xmg)%*%Xmg)))%*% t(Xmg) 
Mt <-(Xt % * %(solve(t(Xt)% * %Xt)))% * % t(Xt) 

Mmt <-(Xmt %*%(solve(t(Xmt)%*%Xmt)))%*% t(Xmt) 
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##### Calculating Sum of Squares ##### 

SSMethod <- round(t(y)%*%(Mm-Mmu)%*%y, digits=3) 

SSGroup <- round(t(y)%*%(Mg-Mmu)%*%y, digits=3) 

SSMethodxGroup <- round(t(y)%*%(Mmg-Mm-Mg+Mmu)%*%y,digits=3) 

SSTeamwGroup <- round(t(y)%*%(Mt-Mg)%*%y, digits=3) 

SSMethodxTeamwGroup <- round(t(y)%*%(Mmt-Mmg-Mt+Mg)%*%y,digits=3) 

SSError <- round(t(y)%*%(diag(l,a*b*c*r,a*b*c*r)-Mmt)%*%y,digits=3) 

SSTotal <- round(t(y)%*%(diag(l,a*b*c*r,a*b*c*r)-Mmu)%*%y,digits=3) 

##### Calculating Means of Squares ##### 
library (psych) 

MSMethod <- round(SSMethod/tr(Mm-Mmu),digits=3) 

MSGroup <- round(SSGroup/tr(Mg-Mmu),digits=3) 

MSMethodxGroup <- round(SSMethodxGroup/tr(Mmg-Mm-Mg+Mmu),digits=3) 
MSTeamwGroup <- round(SSTeamwGroup/tr(Mt-Mg),digits=3) 

MSMethodxTeamwGroup <- round(SSMethodxTeamwGroup/tr(Mmt-Mmg-Mt+Mg),digits=3) 
MSError <- round(SSError/tr(diag(l,a*b*c*r,a*b*c*r)-Mmt),digits=3) 

MSTotal <- round(SSTotal/tr(diag(l,a*b*c*r,a*b*c*r)-Mmu),digits=3) 


##### Calculating F ##### 

FMethod <- round(MSMethod/MSMethodxTeamwGroup,digits=3) 

FGroup <- round(MSGroup/MSTeamwGroup,digits=3) 

FMethodxGroup <- round(MSMethodxGroup/MSMethodxTeamwGroup,digits=3) 
FTeamwGroup <- round(MSTeamwGroup/MSError,digits=3) 
FMethodxTeamwGroup <- round(MSMethodxTeamwGroup/MSError,digits=3) 


##### Summary ##### 

sources<-rbind("Method", "Group","MethodxGroup","Team(Group)","MethodxTeam(Group)", 

"Error","Total") 

Values <- cbind("Source","Deg Frdm","SS","MS","F") 

SS1<- 

rbind(SSMethod,SSGroup,SSMethodxGroup, SSTeamwGroup, SSMethodxTeamwGroup,SSError,SSTotal) 
MS1<- 

rbind(MSMethod,MSGroup,MSMethodxGroup,MSTeamwGroup,MSMethodxTeamwGroup,MSError,"") 
DB1 <-rbind(tr(Mm-Mmu),tr(Mg-Mmu),tr(Mmg-Mm-Mg+Mmu),tr(Mt-Mg),tr(Mmt-Mmg-Mt+Mg), 
tr(diag(l,a*b*c*r,a*b*c*r)-Mmt),tr(diag(l,a*b*c*r,a*b*c*r)-Mmu)) 

FI <- rbind(FMethod,FGroup,FMethodxGroup,FTeamwGroup,FMethodxTeamwGroup,""") 
nestedfactrabc <-cbind(sources,DB 1 ,SS 1 ,MS 1 ,F1) 
outnestedfactorial <-rbind( Values,nestedfactrabc) 
outnestedfactorial 


Table 1. Matrix Multiplication 



m m 

M a 

M p 

M ap 

M r 

M ar 

M, 

m m 

M u 

M ^ 

M ft 

M ft 

M, 

M a 

M M 

M a 

M ^ 

M a 

M ^ 

M a 

Mp 

m m 

M ft 

M p 

Mp 

Mp 

Mp 

M a p 

M u 

M a 

M p 

M ap 

Mp 

M ap 

M y 

M M 

M ^ 

Mp 

Mp 

My 

My 

M ay 

M M 

M a 

Mp 

M ap 

M y 

M ay 
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